column (issue 65 p446) about how to call DLLs 
from within Visual Basic, so I’m not going to 
cover the same ground again.) The library is 
called Speech.dll, and it contains two functions 
described as: 


int FAR PASCAL Say(LPSTR IpEnglishString); _| 
which will say the text in |pEnglishString, and: 


int FAR PASCAL SetSayGlobal(int Reserved1, int Reserved2, 
int Pitch, int Speed, int Volume); _| 


which will set the speech reproduction to the 
specified pitch, speed and volume. The control 
over pitch, speed and volume is unique to the 
DLL approach. 

Translating these into Viswal Basic external 
Declares is fairly easy as long as you’re not put off 
by the mystical-sounding FAR PASCAL. This is 
just the standard method of calling a Windows 
DLL. So to use the Say function you need the 
Declare: 


Declare Function Say Lib “Speech.dll" (ByVal text As String) 
As Integer. 


in the declarations section. After this you can 
include calls such as: 


result =Say("Do you want me to open the pod bay doors 
now Dave?") | 


and Monologue will say the text for you. Similarly, 
if you include the Declare: 


Declare Function SetSayGlobal Lib "Speech.dil" | 
(ByVal resi As Integer, ByVal res2 As Integer, | 
ByVal Pitch As Integer, ByVal Speed As Integer, _| 
ByVal Volume As Integer) As Integer! 


you can set the pitch, speed and volume of the 
speech. 

It’s fun hearing Visual Basic applications 
talk, but the facility has obvious serious applica- 
tions too. When you think how recently speech 
synthesis of this quality needed special hard- 
ware, it makes you realise that even now the 
momentum hasn’t gone from the microcomput- 
er revolution. 


Alternative compression 


After describing in these pages the 
») compression algorithm used for 

©) non-lossy disk compression, as typi- 
@ fied by Stacker and DoubleSpace, 
and lossy compression, as used in JPeg image 
compression, I thought you might like to meet 
two other very clever compression ideas. The 
first is an impossible compression algorithm that 
could simply never be made to work; the second, 
although it seems to be very similar to the first, is 
in fact extremely practical. 


@ Ratio compression 

Suppose you want to create a highly compressed 
form of the entire British Library, physically 
small enough to be carried in your pocket. You 
could start by keying in every book using stan- 
dard Ascii code for the text and any reasonable 
representation for the graphics. Next, consider 
the entire string of ordered bytes that result as a 
fixed-point number with the decimal point to the 
far right. The binary fraction that results is equiv- 
alent to the decimal fraction A/B. Now for the 
final coding step: take a 1cm rod of metal and 
place a mark on it that divides the rod into the 
same ratio, A:B. 
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You've now coded the entire collection of the 
British Library as a single mark on a lcm rod. 
The reasoning is perfectly good in theory. But in 
practice? 


@ Arithmetic coding 

Arithmetic coding is very similar to optimal 
Huffman coding in its aims, but it works better in 
practice. Huffman’s limitation is that where the 
theoretical number of bits needed to code some- 
thing is fractional, the actual number of bits used 
still has to be whole. If in theory it needs 1.5 bits 
to optimally code a symbol, the best Huffman can 
do is to assign it two bits. Fractional coding real- 
ly can assign it 1.5 bits, thus avoiding waste. 

To use arithmetic coding, you first need to 
know the probability of each symbol occurring 
(as you do with a Huffman code). In Huffman 
coding you assign the shortest binary codes to 
the most probable symbols. In arithmetic coding 
you assign a portion of the range 0 to <1 to each 
symbol such that it’s proportional to its probabil- 
ity. For example, if the letter ‘S’ has a probability 
of 0.1 of occurring you might assign it to the 
range x>=0x<0.1; if‘T’ has a probability of 0.2 you 
might assign it to x>=0.1 x<0.3; and so on. Each 
symbolis assigned to a non-overlapping range to 
produce a lookup table. 

How is this used to code a sequence of sym- 
bols? Well, the first symbol is used to look up the 
range in the table. This range codes the symbol, 
in the sense that quoting it or just the lower 
bound of the range is enough to tell you which 
symbol should be decoded. Next the second 
symbol is looked up in the table, but now the 
resultis taken to give a further division of the first 
symbol’s range (see Figure 1). 

For example, if the first symbol was a “T’ the 
range would be x>=0.1 x<0.3. If the second sym- 
bol was an ‘S’, this would divide the ‘T’ range up 
in the same way that x>=0.0 
x<0.1 divides up 0 to <1. A little 
arithmetic gives the new range 
as x>=0.10 x<0.12. As each sym- 
bol is coded, it further restricts 
the range selected by the previ- 
ous symbols. Each time, the cur- 
rent range is divided up as if it 
were the 0 to <1 range in minia- 
ture to code the next symbol, 
and this adds another decimal 
digit to the numbers that give 
the range. For example, if the 
next symbol wasa ‘T’ the lookup 
range would bex>=0.1x<0.3 and 
the current range would have to 
be restricted to between the 10 
percent and 30 percent points, ie 
x>=0.102 x<0.106 — and so on. 


Highs and lows 

With the help of a pencil and 
paper, you should be able to see 
that the coding algorithm for 
each symbol is: 


interval =high-low_| 

high = low+interval*HIGH(symbol) _| 

low = low+interval*LOW(symbol) _| 
Next symbol _| 
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@ How arithmetic compression works 


The coding line is defined 


where HIGH(symbol) and LOW(symbol) repre- 
sent the high and low cut-off points from the 
lookup table. 

When the coding process is over, you have 
two n-digit fractions, the low and high value of 
the interval, which can be uniquely decoded 
back to the symbol string. In fact, any fraction 
within the interval can be used to code the 
sequence of symbols, because any other 
sequence would have produced a different final 
interval. ; 

As the two ranges grow closer together after 
each symbol, a simple method is to output the 
digits in high and low that agree; once they 
agree, they'll never change. This gives a fraction 
with the smallest number of digits that codes for 
the sequence of symbols. 

The decoding process is easy: all you have to 
do is look at the first digit to discover the first 
symbol, then you remove the effect of that sym- 
bol’s interval on the value. For example, decod- 
ing 0.102 clearly gives a ‘T’ as the first symbol 
because it’s in the range 0.1>=x x<0.3. Next, 
removing the effect of this range from the num- 
ber requires subtracting the low range for ‘T’ and 
dividing by the interval for ‘T’, ie subtracting 0.1 
and dividing by 0.2 to give 0.01. The value 0.01 
clearly codes for an ‘S’ because it’s in the range 
0<=x x<.1, and removing the influence of its 
range, ie subtracting 0.0 and dividing by 0.1, 
gives 0.1 which is ‘T’— and so on. 
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The first symbol (T) codes to 0.1<=x<0.3 


Sum of the parts 
Why is this strange cod- 
ing method a good one? 
Well, it assigns a larger 
range to more probable 
symbols. This means it 
generates fewer decimal 
digits for a probable sym- 
bolthan for an improbable 
one. So, as with a Huffman 
code, the number of digits 
(or bits) assigned to a 
symbol depends on how 
often the symbol is likely 
to occur. The difference 
between Huffman and 
arithmetic coding is that 
the arithmetic method 
codes the entire symbol 
string. For example, sup- 
pose you’re compressing 
a black-and-white image, 
with white being 90 per- 
cent probable and black 
10 percent probable. 
Huffman coding would 
assign 0.15 bits to coding 
the white. Of course, you can’t actually assign 
0.15 bits to anything, so, rounding up, you have 
to assign a whole bit. This gives one bit for white 
and one bit for black — no compression. 

Now consider the same situation using arith- 
metic coding. The lookup table would be: 


Low (<=) High (<) 
Black (0) 0.0 0.9 
White (1) 0.9 1.0 


Now consider a string of 100 zeros. Ifyou try cod- 
ing these, you'll find that the low value always 
remains at zero, so the entire string can be rep- 
resented by the binary fraction 0.0. Of course, 
this ignores the need to include an ‘end of mes- 
sage’, but when you include the practicalities the 
final result is still almost as good. You can prove 
that arithmetic coding is optimal even if the num- 
ber of bits to be assigned to a symbol is fraction- 
al, and in this sense it’s better and more general 
than Huffman coding. 


In theory 
Books for smart people 


Just in case you're not a member of, 
and don’t aspire to, the DOS for 
Dummies fraternity, I thought it 
might be nice to tell you about a few 
books that are suitable for beings with above- 
average intelligence. 

If you followed my recent series on 
DDE/OLE, you might want a deeper program- 
mer-oriented view of the subject. There are a cou- 
ple of good books on OLE, but not so many on 
DDE. If you want to add DDE to your programs 
via the DDEML library, try The Windows 
Programmer's Guide to OLE/DDE by JD Clark 
(SAMS), ISBN 0-672-30226. It comes with a disk 
and plenty of programming examples. The final 
two-thirds of the book is about OLE, but it makes 
a good foundation for reading up on OLE 2. 


‘Anice read’ is the best way to describe CMU 
Computer Science: A 25th Anniversary 
Commemorative (ACM Press, distributed by 
Addison-Wesley), ISBN 0201-52899-1. This a col- 
lection of essays from past and present Carnegie 
Mellon University people, including Gorden 
‘Father of the VAX’ Bell, Jon ‘Programming 
Pearls’ Bentley, Zohar ‘I can prove your program 
correct’ Manna, Ivan ‘SketchPad’ Sutherland, 
Allen ‘AI’ Newell and his mate Herbert ‘I do Alas 
well’ Simon-—and many more less well known but 
nevertheless important computer scientists. The 
essays are all very readable, but be warned that 
they're on the technical side and the writers 
always expect you to know something about 
their subjects. The subjects are so varied that 
you're bound to find one or two essays that don’t 
interest you, but most will — ‘Running experi- 
ments with a planar biped’, ‘Why can’t we model 
the physical world?’, ‘Tools for experiments on 
algorithms’, ‘Expressing temporal behaviour 
declaratively’, ‘What is scientifically knowable?’ 
and ‘Technology and courage’ are just a few 
examples from a total of 22. A very adult read... 

Programming and computer science are 
based on, or perhaps have given rise to, their own 
little area of mathematics. It’s a help to know 
something about the maths behind it all, even if 
you don’t have to cram it for an exam, but finding 
the right book is difficult. Mathematical 
Structures for Computer Science isa bit on the dry 
side for casual reading, but it does contain the 
basic grounding you need if you’re going to tack- 
le anything difficult. The topics covered are for- 
mal logic, proofs, recursion and analysis of algo- 
rithms, sets and combinatorics, relations func- 
tions and matrices, graphs and trees, graph algo- 
rithms, boolean algebra and computer logic, 
modelling arithmetic computation and lan- 
guages. The book isvery well organised and neat 
and tidy, but occasionally lacks motivation. You 
might find yourself reading up on something and 
wondering what it has to do with computers. 
Most of the time, however, the connection is 
clear. It finishes up with a look at computability, 
but it doesn’t go very far. If you feel you missed 
out on the mathematical fundamentals for what- 
ever reason, have a look. 

And finally, a book that’s so new and so tech- 
nical that I’m still reading it! Windows Internals 
by Matt Pietrek (Addison Wesley), ISBN 0-201- 
62217-3, tells you how Windows works. It’s avery 
detailed account, but it does give you an overview 
that’s lacking in Pietrek’s earlier book, the very 
valuable Undocumented Windows. If you want to 
know exactly what happens as Windows loads 
itself, for example, you'll find a clear description. 
The scheduler, GDI, message handling, and 
other major areas are also covered. The book 
contains lots of pseudo-code listings to make 
explicit what happens. If you work with Windows 
and want to know more than you get from the offi- 
cial documentation, this is the place to start. My 
only criticism is that the style is unnecessarily 
self-indulgent. I don’t really want to know about 
Matt’s dogs and his 
mother — perhaps 
the dummy spirit is 
getting into the 
more technical US 
stuff as well! my 


Monologue for Windows £89 
lansyst 
(071) 607 5844 


